Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 23699 |
| Missing cells | 101441 |
| Missing cells (%) | 18.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 8.2 MiB |
| Average record size in memory | 364.3 B |
Variable types
| Numeric | 16 |
|---|---|
| Categorical | 4 |
| Boolean | 3 |
first_day_exposition has a high cardinality: 1491 distinct values | High cardinality |
locality_name has a high cardinality: 364 distinct values | High cardinality |
last_price is highly correlated with total_area and 4 other fields | High correlation |
total_area is highly correlated with last_price and 3 other fields | High correlation |
rooms is highly correlated with last_price and 2 other fields | High correlation |
ceiling_height is highly correlated with last_price and 1 other fields | High correlation |
floors_total is highly correlated with floor | High correlation |
living_area is highly correlated with last_price and 2 other fields | High correlation |
floor is highly correlated with floors_total | High correlation |
kitchen_area is highly correlated with last_price and 2 other fields | High correlation |
last_price is highly correlated with total_area and 2 other fields | High correlation |
total_area is highly correlated with last_price and 3 other fields | High correlation |
rooms is highly correlated with total_area and 1 other fields | High correlation |
floors_total is highly correlated with floor | High correlation |
living_area is highly correlated with last_price and 2 other fields | High correlation |
floor is highly correlated with floors_total | High correlation |
kitchen_area is highly correlated with last_price and 1 other fields | High correlation |
last_price is highly correlated with total_area | High correlation |
total_area is highly correlated with last_price and 2 other fields | High correlation |
rooms is highly correlated with total_area and 1 other fields | High correlation |
living_area is highly correlated with total_area and 1 other fields | High correlation |
last_price is highly correlated with total_area and 2 other fields | High correlation |
total_area is highly correlated with last_price and 3 other fields | High correlation |
rooms is highly correlated with last_price and 3 other fields | High correlation |
floors_total is highly correlated with floor | High correlation |
living_area is highly correlated with last_price and 3 other fields | High correlation |
floor is highly correlated with floors_total | High correlation |
kitchen_area is highly correlated with total_area and 2 other fields | High correlation |
airports_nearest is highly correlated with cityCenters_nearest | High correlation |
cityCenters_nearest is highly correlated with airports_nearest | High correlation |
parks_around3000 is highly correlated with parks_nearest | High correlation |
parks_nearest is highly correlated with parks_around3000 | High correlation |
ceiling_height has 9195 (38.8%) missing values | Missing |
living_area has 1903 (8.0%) missing values | Missing |
is_apartment has 20924 (88.3%) missing values | Missing |
kitchen_area has 2278 (9.6%) missing values | Missing |
balcony has 11519 (48.6%) missing values | Missing |
airports_nearest has 5542 (23.4%) missing values | Missing |
cityCenters_nearest has 5519 (23.3%) missing values | Missing |
parks_around3000 has 5518 (23.3%) missing values | Missing |
parks_nearest has 15620 (65.9%) missing values | Missing |
ponds_around3000 has 5518 (23.3%) missing values | Missing |
ponds_nearest has 14589 (61.6%) missing values | Missing |
days_exposition has 3181 (13.4%) missing values | Missing |
last_price is highly skewed (γ1 = 25.80427519) | Skewed |
ceiling_height is highly skewed (γ1 = 41.70907732) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
total_images has 1059 (4.5%) zeros | Zeros |
balcony has 3758 (15.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-04-16 19:42:09.130055 |
|---|---|
| Analysis finished | 2022-04-16 19:43:02.950568 |
| Duration | 53.82 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 23699 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11849 |
| Minimum | 0 |
|---|---|
| Maximum | 23698 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1184.9 |
| Q1 | 5924.5 |
| median | 11849 |
| Q3 | 17773.5 |
| 95-th percentile | 22513.1 |
| Maximum | 23698 |
| Range | 23698 |
| Interquartile range (IQR) | 11849 |
Descriptive statistics
| Standard deviation | 6841.456351 |
|---|---|
| Coefficient of variation (CV) | 0.5773868133 |
| Kurtosis | -1.2 |
| Mean | 11849 |
| Median Absolute Deviation (MAD) | 5925 |
| Skewness | 0 |
| Sum | 280809451 |
| Variance | 46805525 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 661 | 1 | < 0.1% |
| 19084 | 1 | < 0.1% |
| 17037 | 1 | < 0.1% |
| 23182 | 1 | < 0.1% |
| 21135 | 1 | < 0.1% |
| 10896 | 1 | < 0.1% |
| 8849 | 1 | < 0.1% |
| 14994 | 1 | < 0.1% |
| 12947 | 1 | < 0.1% |
| Other values (23689) | 23689 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 23698 | 1 | |
| 23697 | 1 | |
| 23696 | 1 | |
| 23695 | 1 | |
| 23694 | 1 | |
| 23693 | 1 | |
| 23692 | 1 | |
| 23691 | 1 | |
| 23690 | 1 | |
| 23689 | 1 |
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.858475041 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 1059 |
| Zeros (%) | 4.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 9 |
| Q3 | 14 |
| 95-th percentile | 20 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 5.682528956 |
|---|---|
| Coefficient of variation (CV) | 0.5764105435 |
| Kurtosis | -0.3359698028 |
| Mean | 9.858475041 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.2585928567 |
| Sum | 233636 |
| Variance | 32.29113534 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 1798 | 7.6% |
| 9 | 1725 | 7.3% |
| 20 | 1694 | 7.1% |
| 8 | 1585 | 6.7% |
| 7 | 1521 | 6.4% |
| 6 | 1482 | 6.3% |
| 11 | 1362 | 5.7% |
| 5 | 1301 | 5.5% |
| 12 | 1225 | 5.2% |
| 0 | 1059 | 4.5% |
| Other values (28) | 8947 |
| Value | Count | Frequency (%) |
| 0 | 1059 | |
| 1 | 872 | |
| 2 | 640 | 2.7% |
| 3 | 769 | |
| 4 | 986 | |
| 5 | 1301 | |
| 6 | 1482 | |
| 7 | 1521 | |
| 8 | 1585 | |
| 9 | 1725 |
| Value | Count | Frequency (%) |
| 50 | 3 | |
| 42 | 1 | < 0.1% |
| 39 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 35 | 2 | |
| 32 | 4 | |
| 31 | 2 | |
| 30 | 2 | |
| 29 | 3 | |
| 28 | 4 |
last_price
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 2978 |
|---|---|
| Distinct (%) | 12.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6541548.772 |
| Minimum | 12190 |
|---|---|
| Maximum | 763000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 12190 |
|---|---|
| 5-th percentile | 1870000 |
| Q1 | 3400000 |
| median | 4650000 |
| Q3 | 6800000 |
| 95-th percentile | 15300000 |
| Maximum | 763000000 |
| Range | 762987810 |
| Interquartile range (IQR) | 3400000 |
Descriptive statistics
| Standard deviation | 10887013.27 |
|---|---|
| Coefficient of variation (CV) | 1.664286799 |
| Kurtosis | 1277.682584 |
| Mean | 6541548.772 |
| Median Absolute Deviation (MAD) | 1500000 |
| Skewness | 25.80427519 |
| Sum | 1.550281643 × 1011 |
| Variance | 1.185270579 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4500000 | 342 | 1.4% |
| 3500000 | 291 | 1.2% |
| 4000000 | 260 | 1.1% |
| 4300000 | 260 | 1.1% |
| 4200000 | 259 | 1.1% |
| 3600000 | 257 | 1.1% |
| 3300000 | 244 | 1.0% |
| 3800000 | 240 | 1.0% |
| 3200000 | 238 | 1.0% |
| 3700000 | 234 | 1.0% |
| Other values (2968) | 21074 |
| Value | Count | Frequency (%) |
| 12190 | 1 | < 0.1% |
| 430000 | 2 | |
| 440000 | 1 | < 0.1% |
| 450000 | 4 | |
| 470000 | 3 | |
| 480000 | 1 | < 0.1% |
| 490000 | 2 | |
| 500000 | 4 | |
| 520000 | 1 | < 0.1% |
| 530000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 763000000 | 1 | |
| 420000000 | 1 | |
| 401300000 | 1 | |
| 330000000 | 1 | |
| 300000000 | 1 | |
| 289238400 | 1 | |
| 245000000 | 1 | |
| 240000000 | 1 | |
| 230000000 | 1 | |
| 190870000 | 1 |
| Distinct | 2182 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.348651 |
| Minimum | 12 |
|---|---|
| Maximum | 900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 12 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 40 |
| median | 52 |
| Q3 | 69.9 |
| 95-th percentile | 116 |
| Maximum | 900 |
| Range | 888 |
| Interquartile range (IQR) | 29.9 |
Descriptive statistics
| Standard deviation | 35.6540829 |
|---|---|
| Coefficient of variation (CV) | 0.5908016553 |
| Kurtosis | 47.52149263 |
| Mean | 60.348651 |
| Median Absolute Deviation (MAD) | 13.9 |
| Skewness | 4.768597224 |
| Sum | 1430202.68 |
| Variance | 1271.213628 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 45 | 419 | 1.8% |
| 42 | 383 | 1.6% |
| 60 | 347 | 1.5% |
| 31 | 346 | 1.5% |
| 44 | 345 | 1.5% |
| 40 | 315 | 1.3% |
| 43 | 301 | 1.3% |
| 32 | 289 | 1.2% |
| 46 | 282 | 1.2% |
| 36 | 280 | 1.2% |
| Other values (2172) | 20392 |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 13 | 3 | |
| 13.2 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 15 | 2 | |
| 15.5 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 2 | |
| 17.2 | 1 | < 0.1% |
| 17.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 900 | 1 | |
| 631.2 | 1 | |
| 631 | 1 | |
| 618 | 1 | |
| 590 | 1 | |
| 517 | 1 | |
| 507 | 1 | |
| 500 | 2 | |
| 495 | 1 | |
| 494.1 | 1 |
| Distinct | 1491 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 MiB |
| 2018-02-01T00:00:00 | 368 |
|---|---|
| 2017-11-10T00:00:00 | 240 |
| 2017-10-13T00:00:00 | 124 |
| 2017-09-27T00:00:00 | 111 |
| 2018-03-26T00:00:00 | 97 |
| Other values (1486) |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 118 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 2019-03-07T00:00:00 |
|---|---|
| 2nd row | 2018-12-04T00:00:00 |
| 3rd row | 2015-08-20T00:00:00 |
| 4th row | 2015-07-24T00:00:00 |
| 5th row | 2018-06-19T00:00:00 |
Common Values
| Value | Count | Frequency (%) |
| 2018-02-01T00:00:00 | 368 | 1.6% |
| 2017-11-10T00:00:00 | 240 | 1.0% |
| 2017-10-13T00:00:00 | 124 | 0.5% |
| 2017-09-27T00:00:00 | 111 | 0.5% |
| 2018-03-26T00:00:00 | 97 | 0.4% |
| 2018-07-10T00:00:00 | 93 | 0.4% |
| 2017-09-28T00:00:00 | 74 | 0.3% |
| 2018-03-06T00:00:00 | 72 | 0.3% |
| 2018-02-08T00:00:00 | 71 | 0.3% |
| 2018-02-20T00:00:00 | 70 | 0.3% |
| Other values (1481) | 22379 |
Length
| Value | Count | Frequency (%) |
| 2018-02-01t00:00:00 | 368 | 1.6% |
| 2017-11-10t00:00:00 | 240 | 1.0% |
| 2017-10-13t00:00:00 | 124 | 0.5% |
| 2017-09-27t00:00:00 | 111 | 0.5% |
| 2018-03-26t00:00:00 | 97 | 0.4% |
| 2018-07-10t00:00:00 | 93 | 0.4% |
| 2017-09-28t00:00:00 | 74 | 0.3% |
| 2018-03-06t00:00:00 | 72 | 0.3% |
| 2018-02-08t00:00:00 | 71 | 0.3% |
| 2018-02-20t00:00:00 | 70 | 0.3% |
| Other values (1481) | 22379 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.070635892 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 197 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.078404851 |
|---|---|
| Coefficient of variation (CV) | 0.5208085377 |
| Kurtosis | 8.689136218 |
| Mean | 2.070635892 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.524982284 |
| Sum | 49072 |
| Variance | 1.162957022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 8047 | |
| 2 | 7940 | |
| 3 | 5814 | |
| 4 | 1180 | 5.0% |
| 5 | 326 | 1.4% |
| 0 | 197 | 0.8% |
| 6 | 105 | 0.4% |
| 7 | 59 | 0.2% |
| 8 | 12 | 0.1% |
| 9 | 8 | < 0.1% |
| Other values (7) | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 197 | 0.8% |
| 1 | 8047 | |
| 2 | 7940 | |
| 3 | 5814 | |
| 4 | 1180 | 5.0% |
| 5 | 326 | 1.4% |
| 6 | 105 | 0.4% |
| 7 | 59 | 0.2% |
| 8 | 12 | 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 14 | 2 | < 0.1% |
| 12 | 1 | < 0.1% |
| 11 | 2 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 12 | 0.1% |
| 7 | 59 |
| Distinct | 183 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 9195 |
| Missing (%) | 38.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.771498897 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 2.52 |
| median | 2.65 |
| Q3 | 2.8 |
| 95-th percentile | 3.3 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 0.28 |
Descriptive statistics
| Standard deviation | 1.261055831 |
|---|---|
| Coefficient of variation (CV) | 0.4550085993 |
| Kurtosis | 2627.139521 |
| Mean | 2.771498897 |
| Median Absolute Deviation (MAD) | 0.15 |
| Skewness | 41.70907732 |
| Sum | 40197.82 |
| Variance | 1.590261809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 3515 | 14.8% |
| 2.6 | 1646 | 6.9% |
| 2.7 | 1574 | 6.6% |
| 3 | 1112 | 4.7% |
| 2.8 | 993 | 4.2% |
| 2.55 | 980 | 4.1% |
| 2.75 | 910 | 3.8% |
| 2.65 | 676 | 2.9% |
| 3.2 | 277 | 1.2% |
| 3.1 | 203 | 0.9% |
| Other values (173) | 2618 | 11.0% |
| (Missing) | 9195 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.75 | 1 | < 0.1% |
| 2 | 11 | |
| 2.2 | 1 | < 0.1% |
| 2.25 | 1 | < 0.1% |
| 2.3 | 4 | < 0.1% |
| 2.34 | 1 | < 0.1% |
| 2.4 | 23 | |
| 2.45 | 15 |
| Value | Count | Frequency (%) |
| 100 | 1 | < 0.1% |
| 32 | 2 | < 0.1% |
| 27.5 | 1 | < 0.1% |
| 27 | 8 | |
| 26 | 1 | < 0.1% |
| 25 | 7 | |
| 24 | 1 | < 0.1% |
| 22.6 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| Distinct | 36 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 86 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.67382374 |
| Minimum | 1 |
|---|---|
| Maximum | 60 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 9 |
| Q3 | 16 |
| 95-th percentile | 25 |
| Maximum | 60 |
| Range | 59 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 6.597172989 |
|---|---|
| Coefficient of variation (CV) | 0.6180702576 |
| Kurtosis | 0.04464170398 |
| Mean | 10.67382374 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.9402749458 |
| Sum | 252041 |
| Variance | 43.52269145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 5788 | |
| 9 | 3761 | |
| 16 | 1376 | 5.8% |
| 12 | 1362 | 5.7% |
| 4 | 1200 | 5.1% |
| 10 | 1174 | 5.0% |
| 25 | 1075 | 4.5% |
| 6 | 914 | 3.9% |
| 17 | 833 | 3.5% |
| 3 | 668 | 2.8% |
| Other values (26) | 5462 |
| Value | Count | Frequency (%) |
| 1 | 25 | 0.1% |
| 2 | 383 | 1.6% |
| 3 | 668 | 2.8% |
| 4 | 1200 | 5.1% |
| 5 | 5788 | |
| 6 | 914 | 3.9% |
| 7 | 592 | 2.5% |
| 8 | 390 | 1.6% |
| 9 | 3761 | |
| 10 | 1174 | 5.0% |
| Value | Count | Frequency (%) |
| 60 | 1 | < 0.1% |
| 52 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 36 | 3 | < 0.1% |
| 35 | 24 | 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 21 | 0.1% |
| 27 | 164 |
living_area
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 1782 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 1903 |
| Missing (%) | 8.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.45785243 |
| Minimum | 2 |
|---|---|
| Maximum | 409.7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 15.2 |
| Q1 | 18.6 |
| median | 30 |
| Q3 | 42.3 |
| 95-th percentile | 69 |
| Maximum | 409.7 |
| Range | 407.7 |
| Interquartile range (IQR) | 23.7 |
Descriptive statistics
| Standard deviation | 22.03044522 |
|---|---|
| Coefficient of variation (CV) | 0.6393446969 |
| Kurtosis | 31.36088975 |
| Mean | 34.45785243 |
| Median Absolute Deviation (MAD) | 11.8 |
| Skewness | 3.909429763 |
| Sum | 751043.3515 |
| Variance | 485.3405164 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 18 | 882 | 3.7% |
| 17 | 675 | 2.8% |
| 30 | 598 | 2.5% |
| 16 | 486 | 2.1% |
| 20 | 481 | 2.0% |
| 28 | 423 | 1.8% |
| 31 | 381 | 1.6% |
| 19 | 329 | 1.4% |
| 32 | 320 | 1.4% |
| 29 | 319 | 1.3% |
| Other values (1772) | 16902 | |
| (Missing) | 1903 | 8.0% |
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 3 | 2 | |
| 5 | 1 | |
| 5.4 | 1 | |
| 6 | 1 | |
| 6.5 | 1 | |
| 8 | 2 | |
| 8.3 | 1 | |
| 8.4 | 1 | |
| 8.5 | 1 |
| Value | Count | Frequency (%) |
| 409.7 | 1 | |
| 409 | 1 | |
| 347.5 | 1 | |
| 332 | 1 | |
| 322.3 | 1 | |
| 312.5 | 1 | |
| 301.5 | 1 | |
| 300 | 1 | |
| 279.6 | 1 | |
| 274 | 1 |
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.892358327 |
| Minimum | 1 |
|---|---|
| Maximum | 33 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 16 |
| Maximum | 33 |
| Range | 32 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 4.885249206 |
|---|---|
| Coefficient of variation (CV) | 0.8290821662 |
| Kurtosis | 2.32865486 |
| Mean | 5.892358327 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.5531408 |
| Sum | 139643 |
| Variance | 23.86565981 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 3368 | |
| 3 | 3073 | |
| 1 | 2917 | |
| 4 | 2804 | |
| 5 | 2621 | |
| 6 | 1305 | 5.5% |
| 7 | 1218 | 5.1% |
| 8 | 1083 | 4.6% |
| 9 | 1051 | 4.4% |
| 10 | 687 | 2.9% |
| Other values (23) | 3572 |
| Value | Count | Frequency (%) |
| 1 | 2917 | |
| 2 | 3368 | |
| 3 | 3073 | |
| 4 | 2804 | |
| 5 | 2621 | |
| 6 | 1305 | 5.5% |
| 7 | 1218 | 5.1% |
| 8 | 1083 | 4.6% |
| 9 | 1051 | 4.4% |
| 10 | 687 | 2.9% |
| Value | Count | Frequency (%) |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 10 | < 0.1% |
| 26 | 24 | 0.1% |
| 25 | 46 | |
| 24 | 63 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 20924 |
| Missing (%) | 88.3% |
| Memory size | 740.9 KiB |
| False | |
|---|---|
| True | 50 |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 2725 | 11.5% |
| True | 50 | 0.2% |
| (Missing) | 20924 |
studio
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.3 KiB |
| False | |
|---|---|
| True | 149 |
| Value | Count | Frequency (%) |
| False | 23550 | |
| True | 149 | 0.6% |
open_plan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.3 KiB |
| False | |
|---|---|
| True | 67 |
| Value | Count | Frequency (%) |
| False | 23632 | |
| True | 67 | 0.3% |
| Distinct | 971 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 2278 |
| Missing (%) | 9.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.5698072 |
| Minimum | 1.3 |
|---|---|
| Maximum | 112 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1.3 |
|---|---|
| 5-th percentile | 5.5 |
| Q1 | 7 |
| median | 9.1 |
| Q3 | 12 |
| 95-th percentile | 20 |
| Maximum | 112 |
| Range | 110.7 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 5.905437934 |
|---|---|
| Coefficient of variation (CV) | 0.5587081981 |
| Kurtosis | 33.7611296 |
| Mean | 10.5698072 |
| Median Absolute Deviation (MAD) | 2.1 |
| Skewness | 4.209631534 |
| Sum | 226415.84 |
| Variance | 34.87419719 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1300 | 5.5% |
| 10 | 1262 | 5.3% |
| 8 | 1110 | 4.7% |
| 9 | 1101 | 4.6% |
| 7 | 1062 | 4.5% |
| 11 | 797 | 3.4% |
| 12 | 662 | 2.8% |
| 8.5 | 415 | 1.8% |
| 5.5 | 400 | 1.7% |
| 14 | 381 | 1.6% |
| Other values (961) | 12931 | |
| (Missing) | 2278 | 9.6% |
| Value | Count | Frequency (%) |
| 1.3 | 1 | < 0.1% |
| 2 | 7 | |
| 2.3 | 1 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| 2.89 | 1 | < 0.1% |
| 3 | 7 | |
| 3.2 | 1 | < 0.1% |
| 3.3 | 1 | < 0.1% |
| 3.4 | 1 | < 0.1% |
| 3.5 | 4 |
| Value | Count | Frequency (%) |
| 112 | 1 | |
| 107 | 1 | |
| 100.7 | 1 | |
| 100 | 1 | |
| 93.2 | 1 | |
| 93 | 1 | |
| 87.2 | 1 | |
| 77 | 2 | |
| 75 | 1 | |
| 72 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11519 |
| Missing (%) | 48.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.150082102 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 3758 |
| Zeros (%) | 15.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.071300393 |
|---|---|
| Coefficient of variation (CV) | 0.9314990568 |
| Kurtosis | 2.505785898 |
| Mean | 1.150082102 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.243099425 |
| Sum | 14008 |
| Variance | 1.147684532 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4195 | 17.7% |
| 0 | 3758 | 15.9% |
| 2 | 3659 | 15.4% |
| 5 | 304 | 1.3% |
| 4 | 183 | 0.8% |
| 3 | 81 | 0.3% |
| (Missing) | 11519 |
| Value | Count | Frequency (%) |
| 0 | 3758 | |
| 1 | 4195 | |
| 2 | 3659 | |
| 3 | 81 | 0.3% |
| 4 | 183 | 0.8% |
| 5 | 304 | 1.3% |
| Value | Count | Frequency (%) |
| 5 | 304 | 1.3% |
| 4 | 183 | 0.8% |
| 3 | 81 | 0.3% |
| 2 | 3659 | |
| 1 | 4195 | |
| 0 | 3758 |
| Distinct | 364 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 49 |
| Missing (%) | 0.2% |
| Memory size | 2.5 MiB |
| Санкт-Петербург | |
|---|---|
| посёлок Мурино | 522 |
| посёлок Шушары | 440 |
| Всеволожск | 398 |
| Пушкин | 369 |
| Other values (359) |
Length
| Max length | 55 |
|---|---|
| Median length | 15 |
| Mean length | 14.22169133 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 104 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Санкт-Петербург |
|---|---|
| 2nd row | посёлок Шушары |
| 3rd row | Санкт-Петербург |
| 4th row | Санкт-Петербург |
| 5th row | Санкт-Петербург |
Common Values
| Value | Count | Frequency (%) |
| Санкт-Петербург | 15721 | |
| посёлок Мурино | 522 | 2.2% |
| посёлок Шушары | 440 | 1.9% |
| Всеволожск | 398 | 1.7% |
| Пушкин | 369 | 1.6% |
| Колпино | 338 | 1.4% |
| посёлок Парголово | 327 | 1.4% |
| Гатчина | 307 | 1.3% |
| деревня Кудрово | 299 | 1.3% |
| Выборг | 237 | 1.0% |
| Other values (354) | 4692 | 19.8% |
Length
| Value | Count | Frequency (%) |
| санкт-петербург | 15721 | |
| посёлок | 2108 | 7.3% |
| деревня | 945 | 3.3% |
| мурино | 590 | 2.0% |
| поселок | 552 | 1.9% |
| кудрово | 472 | 1.6% |
| шушары | 440 | 1.5% |
| всеволожск | 398 | 1.4% |
| пушкин | 369 | 1.3% |
| типа | 363 | 1.3% |
| Other values (330) | 6913 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 8275 |
|---|---|
| Distinct (%) | 45.6% |
| Missing | 5542 |
| Missing (%) | 23.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28793.67219 |
| Minimum | 0 |
|---|---|
| Maximum | 84869 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 11557.4 |
| Q1 | 18585 |
| median | 26726 |
| Q3 | 37273 |
| 95-th percentile | 51340 |
| Maximum | 84869 |
| Range | 84869 |
| Interquartile range (IQR) | 18688 |
Descriptive statistics
| Standard deviation | 12630.88062 |
|---|---|
| Coefficient of variation (CV) | 0.4386686261 |
| Kurtosis | -0.2883133015 |
| Mean | 28793.67219 |
| Median Absolute Deviation (MAD) | 9265 |
| Skewness | 0.5409568907 |
| Sum | 522806706 |
| Variance | 159539145.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37434 | 61 | 0.3% |
| 21928 | 32 | 0.1% |
| 39946 | 30 | 0.1% |
| 44870 | 30 | 0.1% |
| 37407 | 27 | 0.1% |
| 18732 | 27 | 0.1% |
| 39140 | 26 | 0.1% |
| 31744 | 25 | 0.1% |
| 37412 | 24 | 0.1% |
| 19499 | 23 | 0.1% |
| Other values (8265) | 17852 | |
| (Missing) | 5542 | 23.4% |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 6450 | 2 | < 0.1% |
| 6914 | 1 | < 0.1% |
| 6949 | 1 | < 0.1% |
| 6989 | 6 | |
| 6992 | 1 | < 0.1% |
| 6995 | 2 | < 0.1% |
| 7002 | 1 | < 0.1% |
| 7016 | 4 | |
| 7019 | 3 |
| Value | Count | Frequency (%) |
| 84869 | 1 | |
| 84853 | 1 | |
| 84665 | 1 | |
| 84006 | 1 | |
| 83758 | 1 | |
| 81607 | 1 | |
| 81355 | 1 | |
| 78527 | 1 | |
| 75646 | 1 | |
| 73827 | 1 |
| Distinct | 7642 |
|---|---|
| Distinct (%) | 42.0% |
| Missing | 5519 |
| Missing (%) | 23.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14191.27783 |
| Minimum | 181 |
|---|---|
| Maximum | 65968 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 181 |
|---|---|
| 5-th percentile | 3541 |
| Q1 | 9238 |
| median | 13098.5 |
| Q3 | 16293 |
| 95-th percentile | 31671.6 |
| Maximum | 65968 |
| Range | 65787 |
| Interquartile range (IQR) | 7055 |
Descriptive statistics
| Standard deviation | 8608.38621 |
|---|---|
| Coefficient of variation (CV) | 0.6065969754 |
| Kurtosis | 4.360911917 |
| Mean | 14191.27783 |
| Median Absolute Deviation (MAD) | 3483.5 |
| Skewness | 1.674916144 |
| Sum | 257997431 |
| Variance | 74104313.14 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8460 | 61 | 0.3% |
| 20802 | 32 | 0.1% |
| 10720 | 30 | 0.1% |
| 8434 | 27 | 0.1% |
| 20444 | 27 | 0.1% |
| 8370 | 26 | 0.1% |
| 10364 | 26 | 0.1% |
| 4836 | 25 | 0.1% |
| 17369 | 24 | 0.1% |
| 13845 | 23 | 0.1% |
| Other values (7632) | 17879 | |
| (Missing) | 5519 | 23.3% |
| Value | Count | Frequency (%) |
| 181 | 1 | < 0.1% |
| 208 | 1 | < 0.1% |
| 215 | 1 | < 0.1% |
| 287 | 1 | < 0.1% |
| 291 | 1 | < 0.1% |
| 318 | 8 | |
| 329 | 1 | < 0.1% |
| 376 | 1 | < 0.1% |
| 387 | 1 | < 0.1% |
| 392 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 65968 | 1 | |
| 65952 | 1 | |
| 65764 | 1 | |
| 65105 | 1 | |
| 64857 | 1 | |
| 62706 | 1 | |
| 62454 | 1 | |
| 61495 | 1 | |
| 60223 | 1 | |
| 59626 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5518 |
| Missing (%) | 23.3% |
| Memory size | 1.3 MiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | |
| 3.0 | 647 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 10106 | |
| 1.0 | 5681 | |
| 2.0 | 1747 | 7.4% |
| 3.0 | 647 | 2.7% |
| (Missing) | 5518 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 10106 | |
| 1.0 | 5681 | |
| 2.0 | 1747 | 9.6% |
| 3.0 | 647 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 995 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 15620 |
| Missing (%) | 65.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 490.804555 |
| Minimum | 1 |
|---|---|
| Maximum | 3190 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 95.9 |
| Q1 | 288 |
| median | 455 |
| Q3 | 612 |
| 95-th percentile | 968 |
| Maximum | 3190 |
| Range | 3189 |
| Interquartile range (IQR) | 324 |
Descriptive statistics
| Standard deviation | 342.3179949 |
|---|---|
| Coefficient of variation (CV) | 0.6974629542 |
| Kurtosis | 12.21768678 |
| Mean | 490.804555 |
| Median Absolute Deviation (MAD) | 163 |
| Skewness | 2.717637751 |
| Sum | 3965210 |
| Variance | 117181.6096 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 441 | 67 | 0.3% |
| 392 | 41 | 0.2% |
| 173 | 41 | 0.2% |
| 456 | 40 | 0.2% |
| 471 | 32 | 0.1% |
| 2102 | 30 | 0.1% |
| 541 | 29 | 0.1% |
| 458 | 29 | 0.1% |
| 447 | 28 | 0.1% |
| 288 | 28 | 0.1% |
| Other values (985) | 7714 | |
| (Missing) | 15620 |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 10 | 7 | |
| 11 | 5 | |
| 12 | 1 | < 0.1% |
| 13 | 6 | |
| 14 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3190 | 2 | |
| 3064 | 1 | |
| 3013 | 1 | |
| 2984 | 1 | |
| 2905 | 1 | |
| 2888 | 1 | |
| 2880 | 1 | |
| 2847 | 1 | |
| 2768 | 1 | |
| 2747 | 1 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5518 |
| Missing (%) | 23.3% |
| Memory size | 1.3 MiB |
| 0.0 | |
|---|---|
| 1.0 | |
| 2.0 | |
| 3.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 2.0 |
| 4th row | 3.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 9071 | |
| 1.0 | 5717 | |
| 2.0 | 1892 | 8.0% |
| 3.0 | 1501 | 6.3% |
| (Missing) | 5518 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 9071 | |
| 1.0 | 5717 | |
| 2.0 | 1892 | 10.4% |
| 3.0 | 1501 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1096 |
|---|---|
| Distinct (%) | 12.0% |
| Missing | 14589 |
| Missing (%) | 61.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 517.9809001 |
| Minimum | 13 |
|---|---|
| Maximum | 1344 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 93 |
| Q1 | 294 |
| median | 502 |
| Q3 | 729 |
| 95-th percentile | 976.55 |
| Maximum | 1344 |
| Range | 1331 |
| Interquartile range (IQR) | 435 |
Descriptive statistics
| Standard deviation | 277.7206427 |
|---|---|
| Coefficient of variation (CV) | 0.5361600063 |
| Kurtosis | -0.7272670503 |
| Mean | 517.9809001 |
| Median Absolute Deviation (MAD) | 215 |
| Skewness | 0.2220908711 |
| Sum | 4718806 |
| Variance | 77128.75537 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 427 | 70 | 0.3% |
| 454 | 41 | 0.2% |
| 153 | 40 | 0.2% |
| 433 | 39 | 0.2% |
| 564 | 37 | 0.2% |
| 474 | 37 | 0.2% |
| 303 | 36 | 0.2% |
| 440 | 33 | 0.1% |
| 359 | 31 | 0.1% |
| 733 | 30 | 0.1% |
| Other values (1086) | 8716 | |
| (Missing) | 14589 |
| Value | Count | Frequency (%) |
| 13 | 2 | < 0.1% |
| 16 | 8 | |
| 19 | 4 | |
| 20 | 5 | |
| 22 | 7 | |
| 23 | 1 | < 0.1% |
| 24 | 7 | |
| 25 | 1 | < 0.1% |
| 26 | 3 | < 0.1% |
| 27 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1344 | 1 | < 0.1% |
| 1341 | 2 | |
| 1337 | 1 | < 0.1% |
| 1313 | 1 | < 0.1% |
| 1299 | 1 | < 0.1% |
| 1293 | 1 | < 0.1% |
| 1278 | 2 | |
| 1275 | 1 | < 0.1% |
| 1271 | 3 | |
| 1270 | 1 | < 0.1% |
| Distinct | 1141 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 3181 |
| Missing (%) | 13.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.8886344 |
| Minimum | 1 |
|---|---|
| Maximum | 1580 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 185.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 45 |
| median | 95 |
| Q3 | 232 |
| 95-th percentile | 647 |
| Maximum | 1580 |
| Range | 1579 |
| Interquartile range (IQR) | 187 |
Descriptive statistics
| Standard deviation | 219.7279882 |
|---|---|
| Coefficient of variation (CV) | 1.214714174 |
| Kurtosis | 6.27662008 |
| Mean | 180.8886344 |
| Median Absolute Deviation (MAD) | 68 |
| Skewness | 2.310052616 |
| Sum | 3711473 |
| Variance | 48280.38878 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 45 | 880 | 3.7% |
| 60 | 538 | 2.3% |
| 7 | 234 | 1.0% |
| 30 | 208 | 0.9% |
| 90 | 204 | 0.9% |
| 4 | 176 | 0.7% |
| 3 | 158 | 0.7% |
| 5 | 152 | 0.6% |
| 14 | 148 | 0.6% |
| 9 | 143 | 0.6% |
| Other values (1131) | 17677 | |
| (Missing) | 3181 | 13.4% |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 158 | |
| 4 | 176 | |
| 5 | 152 | |
| 6 | 124 | |
| 7 | 234 | |
| 8 | 139 | |
| 9 | 143 | |
| 10 | 127 |
| Value | Count | Frequency (%) |
| 1580 | 1 | |
| 1572 | 1 | |
| 1553 | 1 | |
| 1513 | 1 | |
| 1512 | 2 | |
| 1497 | 1 | |
| 1489 | 1 | |
| 1485 | 1 | |
| 1484 | 1 | |
| 1477 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Unnamed: 0 | total_images | last_price | total_area | first_day_exposition | rooms | ceiling_height | floors_total | living_area | floor | is_apartment | studio | open_plan | kitchen_area | balcony | locality_name | airports_nearest | cityCenters_nearest | parks_around3000 | parks_nearest | ponds_around3000 | ponds_nearest | days_exposition | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 20 | 13000000.0 | 108.00 | 2019-03-07T00:00:00 | 3 | 2.70 | 16.0 | 51.00 | 8 | NaN | False | False | 25.00 | NaN | Санкт-Петербург | 18863.0 | 16028.0 | 1.0 | 482.0 | 2.0 | 755.0 | NaN |
| 1 | 1 | 7 | 3350000.0 | 40.40 | 2018-12-04T00:00:00 | 1 | NaN | 11.0 | 18.60 | 1 | NaN | False | False | 11.00 | 2.0 | посёлок Шушары | 12817.0 | 18603.0 | 0.0 | NaN | 0.0 | NaN | 81.0 |
| 2 | 2 | 10 | 5196000.0 | 56.00 | 2015-08-20T00:00:00 | 2 | NaN | 5.0 | 34.30 | 4 | NaN | False | False | 8.30 | 0.0 | Санкт-Петербург | 21741.0 | 13933.0 | 1.0 | 90.0 | 2.0 | 574.0 | 558.0 |
| 3 | 3 | 0 | 64900000.0 | 159.00 | 2015-07-24T00:00:00 | 3 | NaN | 14.0 | NaN | 9 | NaN | False | False | NaN | 0.0 | Санкт-Петербург | 28098.0 | 6800.0 | 2.0 | 84.0 | 3.0 | 234.0 | 424.0 |
| 4 | 4 | 2 | 10000000.0 | 100.00 | 2018-06-19T00:00:00 | 2 | 3.03 | 14.0 | 32.00 | 13 | NaN | False | False | 41.00 | NaN | Санкт-Петербург | 31856.0 | 8098.0 | 2.0 | 112.0 | 1.0 | 48.0 | 121.0 |
| 5 | 5 | 10 | 2890000.0 | 30.40 | 2018-09-10T00:00:00 | 1 | NaN | 12.0 | 14.40 | 5 | NaN | False | False | 9.10 | NaN | городской посёлок Янино-1 | NaN | NaN | NaN | NaN | NaN | NaN | 55.0 |
| 6 | 6 | 6 | 3700000.0 | 37.30 | 2017-11-02T00:00:00 | 1 | NaN | 26.0 | 10.60 | 6 | NaN | False | False | 14.40 | 1.0 | посёлок Парголово | 52996.0 | 19143.0 | 0.0 | NaN | 0.0 | NaN | 155.0 |
| 7 | 7 | 5 | 7915000.0 | 71.60 | 2019-04-18T00:00:00 | 2 | NaN | 24.0 | NaN | 22 | NaN | False | False | 18.90 | 2.0 | Санкт-Петербург | 23982.0 | 11634.0 | 0.0 | NaN | 0.0 | NaN | NaN |
| 8 | 8 | 20 | 2900000.0 | 33.16 | 2018-05-23T00:00:00 | 1 | NaN | 27.0 | 15.43 | 26 | NaN | False | False | 8.81 | NaN | посёлок Мурино | NaN | NaN | NaN | NaN | NaN | NaN | 189.0 |
| 9 | 9 | 18 | 5400000.0 | 61.00 | 2017-02-26T00:00:00 | 3 | 2.50 | 9.0 | 43.60 | 7 | NaN | False | False | 6.50 | 2.0 | Санкт-Петербург | 50898.0 | 15008.0 | 0.0 | NaN | 0.0 | NaN | 289.0 |
Last rows
| Unnamed: 0 | total_images | last_price | total_area | first_day_exposition | rooms | ceiling_height | floors_total | living_area | floor | is_apartment | studio | open_plan | kitchen_area | balcony | locality_name | airports_nearest | cityCenters_nearest | parks_around3000 | parks_nearest | ponds_around3000 | ponds_nearest | days_exposition | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23689 | 23689 | 13 | 3550000.0 | 35.30 | 2018-02-28T00:00:00 | 1 | 2.86 | 15.0 | 16.3 | 4 | NaN | False | False | 9.10 | 2.0 | Санкт-Петербург | 17284.0 | 16081.0 | 1.0 | 353.0 | 2.0 | 652.0 | 29.0 |
| 23690 | 23690 | 3 | 5500000.0 | 52.00 | 2018-07-19T00:00:00 | 2 | NaN | 5.0 | 31.0 | 2 | NaN | False | False | 6.00 | NaN | Санкт-Петербург | 20151.0 | 6263.0 | 1.0 | 300.0 | 0.0 | NaN | 15.0 |
| 23691 | 23691 | 11 | 9470000.0 | 72.90 | 2016-10-13T00:00:00 | 2 | 2.75 | 25.0 | 40.3 | 7 | NaN | False | False | 10.60 | 1.0 | Санкт-Петербург | 19424.0 | 4489.0 | 0.0 | NaN | 1.0 | 806.0 | 519.0 |
| 23692 | 23692 | 2 | 1350000.0 | 30.00 | 2017-07-07T00:00:00 | 1 | NaN | 5.0 | 17.5 | 4 | NaN | False | False | 6.00 | NaN | Тихвин | NaN | NaN | NaN | NaN | NaN | NaN | 413.0 |
| 23693 | 23693 | 9 | 4600000.0 | 62.40 | 2016-08-05T00:00:00 | 3 | 2.60 | 9.0 | 40.0 | 8 | NaN | False | False | 8.00 | 0.0 | Петергоф | 45602.0 | 34104.0 | 1.0 | 352.0 | 1.0 | 675.0 | 239.0 |
| 23694 | 23694 | 9 | 9700000.0 | 133.81 | 2017-03-21T00:00:00 | 3 | 3.70 | 5.0 | 73.3 | 3 | NaN | False | False | 13.83 | NaN | Санкт-Петербург | 24665.0 | 4232.0 | 1.0 | 796.0 | 3.0 | 381.0 | NaN |
| 23695 | 23695 | 14 | 3100000.0 | 59.00 | 2018-01-15T00:00:00 | 3 | NaN | 5.0 | 38.0 | 4 | NaN | False | False | 8.50 | NaN | Тосно | NaN | NaN | NaN | NaN | NaN | NaN | 45.0 |
| 23696 | 23696 | 18 | 2500000.0 | 56.70 | 2018-02-11T00:00:00 | 2 | NaN | 3.0 | 29.7 | 1 | NaN | False | False | NaN | NaN | село Рождествено | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 23697 | 23697 | 13 | 11475000.0 | 76.75 | 2017-03-28T00:00:00 | 2 | 3.00 | 17.0 | NaN | 12 | NaN | False | False | 23.30 | 2.0 | Санкт-Петербург | 39140.0 | 10364.0 | 2.0 | 173.0 | 3.0 | 196.0 | 602.0 |
| 23698 | 23698 | 4 | 1350000.0 | 32.30 | 2017-07-21T00:00:00 | 1 | 2.50 | 5.0 | 12.3 | 1 | NaN | False | False | 9.00 | NaN | поселок Новый Учхоз | NaN | NaN | NaN | NaN | NaN | NaN | NaN |